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Abstract 

Eukaryotic mRNA decay is a highly regulated process allowing cells to rapidly modulate protein production in response to 
Internal and envlronnnental cues. Mature translatable eukaryotic mRNAs are protected from fast and uncontrolled 
degradation in the cytoplasm by two cis-acting stability determinants: a methylguanosine {m''G) cap and a poly(A) tail at 
their 5' and 3' extremities, respectively. The hydrolysis of the m^G cap structure, known as decapping, is performed by the 
complex composed of the Dcp2 catalytic subunit and its partner Dcpl. The Dcpl-Dcp2 decapping complex has a low 
intrinsic activity and requires accessory factors to be fully active. Among these factors, Patl is considered to be a central 
scaffolding protein involved in Dcp2 activation but also in inhibition of translation initiation. Here, we present the structural 
and functional study of the C-terminal domain from S. cerevisiae Patl protein. We have identified two conserved and 
functionally important regions located at both extremities of the domain. The first region is involved in binding to Lsml-7 
complex. The second patch is specific for fungal proteins and is responsible for Patl interaction with Edc3. These 
observations support the plasticity of the protein interaction network involved in mRNA decay and show that evolution has 
extended the C-terminal alpha-helical domain from fungal Patl proteins to generate a new binding platform for protein 
partners. 
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introduction 

Protein synthesis in eukaryotic cells is finely tuned both by 
controlling mRNA levels and their translation. The steady-state 
level of a given mRNA is governed by its rates of transcription and 
degradation. Hence, the regulation of mRNA decay provides a 
rapid mechanism to adapt to changing conditions. Eukaryotic 
mature mRNAs are protected from fast and uncontrolled 
degradation in the cytoplasm by two main CM-acting stability 
determinants: a methylguanosine (m G) cap and a poly(A) tail 
present at their 5' and 3' extremities, respectively. Bulk mRNA 
decay is initiated by 3' poly(A) tail shortening followed by 
degradation via either the 3'^5' or 5'^3' mRNA decay 
mechanisms [1]. In the 3 '^5' mRNA decay, the mRNA is 
degraded by the exosome, a multiprotein complex with both exo- 



and endo-nucleolytic activities embedded within a single protein 
Dis3 [2,3,4,5,6]. This generates a 5' capped ohgonucleotide 
byproduct (less than 1 0 base long) that is further degraded by the 
scavenger protein DcpS (Dcsl in budding yeast) to release 
m'GMP [7]. In the 5'^3' pathway, the 5' cap is eliminated by 
a critical and highly regulated step known as decapping, which is 
followed by the rapid digestion of the mRNA body from its 5' end, 
a process mediated by the exonuclease Xrnl [8,9]. 

The decapping step consists in the hydrolysis of the cap 
structure generating m'ODP and a 5' phosphorylated RNA 
molecule. This reaction is performed by the decapping holoen- 
zyme composed by the catalytic subunit Dcp2 and its co-factor 
Dcpl [10,11,12]. This complex has a low intrinsic decapping 
activity and requires one or several accessory factors (such as 
Lsml-7, Patl or Edcl-4 proteins) to be fuUy efficient [13,14]. In 
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particular, deletion of the LSMl-7 genes results in mRNAs 
stabilization associated with accumulation of deadenylated capped 
mRNAs [15,16]. These genes encode for Lsm proteins, which 
associate to form the heteroheptameric Lsm 1-7 complex. This 
complex binds preferentially to deadenylated or oligoadenylated, 
but not polyadenylated, mRNAs [17,18] and interacts with Patl 
[15,16,19]. Edc3 is another activator of decapping but EDC3 
deletion does not result in the general accumulation of capped 
mRNA decay intermediates. Edc3 contains an Sm/Lsm domain 
and interacts with the yeast DEAD-box helicase Dhhl (RCK/ 
p54/DDX6 in mammals, MeSlB in fruit fly) and Patl (Pat lb in 
mammals, HPat in fruit fly) [20,21,22,23]. Edc3 was also proposed 
to associate to Dcp2 via its Lsm domain and to stimulate thereby 
its enzymatic activity in vitro [24,25,26]. Finally, in fungi Edc3 was 
shown to regulate the TRAl pre-mRNA level and to interact with 
the ribosomal protein Rps28 participating thereby to an autoreg- 
ulatory loop controlling the decapping of mRNAs encoding this 
protein [27,28,29,30]. Additional decapping activators are specilic 
to fungi or metazoans. In yeast, this includes the RNA-binding 
proteins Edcl and Edc2 that stimulate Dcp2 activity in vitro [31]. 
In contrast, metazoan EDC4 (also known as Ge-1 or Hedls 
proteins) stimulates DCP2 by promoting its association with DCPl 
[32,33]. 

The deletion of the PATI gene in S. cerevisiae results in a 
thermosensitive growth phenotype associated with a strong 
decapping inhibition in vivo [15,16,19]. These observations 
revealed that Patl is a critical component of the decapping 
machinery'. As Pat 1 interacts with numerous mRNA decay factors 
including the Lsml-7 complex, the 5'^3' exonuclease Xrnl, the 
DEAD-box helicasi; Dhhl, Edc3, and Dcp2 in yeast and other 
organisms [15,16,19,21,22,23,26,34,35], it appears to function as a 
scafibld protein allowing these proteins to form a multisubunit 
assembly that mediates regulated mRNA decapping. Patl is 
composed of several regions (domains N for N-terminal, M for 
Middle and C for C-terminal) that are responsible for interaction 
with various partners and that are implicated in distinct Patl 
functions. The interacting network formed by Patl from yeast, 
drosophUa and human and its partners has been dissected using 
various strategies: co-immunoprecipitation of proteins expressed in 
insect cells, interaction analysis using recombinant S. cerevisiae 
proteins or domains, and yeast two-hybrid assays [21,26,34,36,37]. 
Dhhl proteins from the three species interact with the N-terminal 
domain of their Patl counterparts and this interaction was 
proposed to modulate the interaction of Dhhl with RNA 
[20,22,34,35,36]. Other common partners of Patl in these three 
species include the Lsml-7 protein complex and subunits of the 
decapping complex (containing Dcpl, Dcp2, and in addition 
EDC4 in mammals; [38]). This is consistent with the requirement 
of Patl and the Lsm proteins for decapping in vivo [15,16,19] and 
stimulation of m iiitm decapping by the S. cerevisiae Patl C domain 
[26]. Recent structural studies have described the interaction 
mode between yeast Patl C domain and the Lsml-7 complex and 
shown that Patl interacts directly with the Lsm2 and Lsm3 
proteins within the Lsml-7 complex [39,40]. Other Patl partners 
related to mRNA decay and translational control have been 
observed in one or two of these three species. This includes 
subunits of the CCR4-NOT complex (human, drosophUa), Scd6 
(yeast), Edc3 (yeast, human) and Xrnl (yeast, human) 
[15,20,21,26,34,41]. At this stage, it is unclear whether some 
interactions are species specific neither whether all these proteins 
interact directly with Patl, especially as different assays were used 
[e.g., co-immunoprecipitation and two-hybrid assays versus inter- 
action between purified recombinant factors). Patl proteins are 
also nucleic acid binding proteins. Indeed, PlOO, the X. hems Patl 



ortholog, binds to single-stranded but not double-stranded DNA 
(RNA was not tested; [42]). In addition, yeast Patl and human 
Pat lb domain C bind to poly(U) RNA but not poly(C) or poly(A) 
[21,36]. 

Altogether these proteins participate to an intricate network of 
protein-protein and protein-RNA interactions, which have been 
proposed to promote decapping by successive steps. A model 
suggests that Dhhl, Patl and Scd6 first repress translation and 
inhibit formation of the 48S initiation complex [26]. This is 
supported by the translational repression induced upon over- 
expression of Patl and Dhhl [43,44]. Second, remodeling of the 
mRNP particles leads to recruitment of new decapping factors and 
loss of eIF4E and eIF4G initiation factors, which were bound to 
the cap [45]. Finally, the Patl and Edc3 proteins as well as the 
Lsml-7 complex activate the Dcp2 catalytic subunit [15,16,26]. 
However, independent experiments suggest that decapping can 
also occur on translated mRNAs [46]. Patl and its partners co- 
locahze to cytoplasmic foci known as P-bodies and most of these 
proteins are important for formation of these mRNP granules 
[47,48,49,50]. These P-bodies contain few translation initiation 
factors and are assumed to be dedicated to the storage of 
translationally repressed mRNAs as well as to the decapping and 
degradation of mRNAs. 

In this paper, we have focused our attention on the S. cerevisiae 
Patl C domain (ScPatlC), which is required for dc-capping and 
involved in interaction with Lsml-7 complex, Xrnl, Edc3, Dcpl 
and Dcp2. We have determined its crystal structure and identified 
two functionally important regions located at both extremities of 
this domain that are important for the recruitment of dilferent 
decapping factors. 

Materials and Methods 

Yeast strains and shuttling plasnnid 

Yeast strains used in this study are listed in Table S 1 . Multiple 
mutant strains were obtained by crossing and tetrad dissertion 
following standard genetic procedures. Strains BSYl 133 with the 
deletion oi PATI gene and YFW168 with deletions oi PATI and 
DHHl genes were kindly provided by Michele Minet. 

Shuttling plasmids for complementation analyses are listed in 
Table S2. Briefly, the yeast PATI gene was amplified from yeast 
genomic DNA using oligonucleotides OBS4954 and OBS4955 
(Table S3). The PGR product was cloned into XhoI/BamHI sites 
of pRS414 [5 1] giving pBS4357. Plasmids bearing PATI gene with 
deletions or point mutations were obtained using the Quikchange 
strategy on the pBS4357 template. Oligonucleotides used in this 
study are listed in Table S3. 

Cell growth conditions 

For complementation analyses, cells were grown to the 
logarithmic phase in synthetic complete (SC) medium lacking 
tryptophan complemented with 2% glucose. Cultures were diluted 
to an optical density of 0. 1 at 600 nm (ODgoo) with sterile water. 
Three |J,L of each strain as well as three |J.L of three consecutive 
10-fold serial dilutions were plated on SG medium lacking 
tryptophan. CeU growth rate was monitored at 25°G, 30°C and 
37°G after 48 or 72 hours in case of the Adhhl / Apatl strain. 

Two-hybrid interaction analysis 

Two-hybrid interaction analyses were done using standard 
procedure in two strains MAV203 (Invitrogen) and the isogenic 
BSY2475 lacking Edc3 [30]. The P-galactosidase activity was 
measured using Beta-Glo Assay system (Promega). Two hybrid 
plasmids were constructed by standard cloning techniques after 
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PGR amplification of the coding sequences. The plasmid 
pBS4907, expressing PATI gene lacking C-terminal 68 amino 
acids fused to Gal4AD, was obtained by the QuickChange strategy 
using OBS5012 and OBS5013 and pBS2374 as a template. 
Oligonucleotides used in this study are listed in Table S3. 

Cloning, expression and purification of PatIC proteins 

The coding sequence of Patl C-terminal domain (hereafter 
named ScPatlC encompassing residues 473 to 796) was amplified 
from yeast Saccharomjces cerevisiae S288C genomic DNA with 
oligonucleotides 0MGI8/0MGI8O (sec Tabk; S3) and inserted 
into pET21-a vector with a sequence coding for a Hisg tag at the 
3' end of the coding sequence, yielding plasmid pMGSll. 

The ScPatlC protein (both unlabeled or Se-Met labeled) was 
expressed in E. coli Rosetta (DE3) pLysS strain (Novagen) in 2YT 
medium supplemented with ampicillin at 100 |j.g/mL and 
chloramphenicol at 25 |ig/mL. At OD6oo = 0.8, the protein 
expression was induced during 20 h at 28°C by adding 50 |a,g/ 
mL IPTG. Cells were harvested by centrifugation and resus- 
pended in 30 mL of buffer A (20 mM Tris-HCl pH 7.5, 200 mM 
NaCl). Cell lysis was performed by sonication. The Hisg tagged 
protein was purified on NiNTA column (Qiagen) followed by a 
heparin column (GE Healthcare) and finally on Superdex 200 16/ 
60 size exclusion column (GE Healthcare) in buffer A. 

The DNA sequences coding for the region encompassing 
residues 435 to 796 from Saccharomyces cerevisiae Patl mutants were 
amplified from the complementation plasmids pBS4436 (mutant 
Q,706A/L713A), pBS4437 (mutant K475E/K476E), pBS4438 
(mutant K531E/K534E/R538E), pBS4439 (mutant R497E) and 
pBS4440 (mutant Q.720A/R721A/D725A/R728A) using oligo- 
nucleotides 0MGII9/0MGII8 (see Table S3). They were sub- 
cloned into pET21-a with a sequence coding for a N-terminal Hisg 
tag to yield expression plasmids pMG606 (mutant Q706A/ 
L713A), pMG607 (mutant K475E/K476E), pMG608 (mutant 
K531E/K534E/R538E), pMG609 (mutant R497E) and pMG6 10 
(mutant Q720A/R72 1 A/D725A/R728A). 

All ScPatl [435-796] mutants were expressed in E. coli 
BL21(DE3) Codon"" (Novagen) in IL of 2YT medium supple- 
mented with ampicillin at 100 |ig/mL and chloramphenicol at 
25 |J,g/mL. At ODfioo = 0.8, protein expression was induced at 
20°C for 20 h by adding 50 ng/mL IPTG. Cells were harvested 
by centrifugation and resuspended in 40 mL of buffer B (20 mM 
Tris-HCl pH 8, 200 mM NaCl) supplemented widi 20 mM 
imidazole. Cell lysis was performed by sonication. 

These mutants were purified on a NiNTA column (Qiagen), 
and then on a Superdex 75 16/60 size exclusion column (GE 
Healthcare) equilibrated in buffer B. 

Cloning, expression and purification of the Lsm1-7 
complex 

A plasmid containing an operon made of the partial (Lsm4, 
Lsm7) or complete (Lsml-3, Lsm5-6) coding sequences of yeast 
Lsml-7 proteins was prepared by amplifying individually each 
coding sequence containing a ribosome binding site upstream of 
each start codon and unique restriction sites at both ends using 
oligonucleotides OBS6352 to OBS6355 and oLsmXF/oLmsXR 
(Tabic S3). In addition, scqiu'iices coding the His^ tag and a TEV 
cleavage site were fused to the N-terminus of the Lsml protein. 
The seven coding sequences were assembled in a stepwise manner 
in the pBS3021 backbone giving plasmid pBS5031. 

The Lsml-7 complex from S. cerevisiae was expressed using 
pBS5031 plasmid in E. coli BL21(DE,3) Gold (Stratagene) in 1 L of 
autoinducible medium at 37°C supplemented with kanamycin at 
100 ng/mL. Cells were harvested by centrifugation and resus- 



pended in 35 mL of buffer C (20 mM Tris-HCl pH 7.5, 500 mM 
NaCl, 25 mM imidazole, 5 mM 2-mercaptoethanol). Cell lysis 
was performed by sonication. The complex was purified on a 
NiNTA column (Qiagen). Eluted Lsml-7 proteins were incubated 
overnight with TEV protease in dialysis buffer (20 mM Tris-HCl 
pH 7.5, 150 mM NaCl, 0.5 mM EDTA, 5 mM 2-mercaptoeth- 
anol). The His-tagged TEV protease and uncleaved complex were 
removed by incubation with NiNTA resin. The untagged Lsml-7 
complex was present in the ffow-through and further purified by 
ion exchange chromatography at pH 7.5 (HiTrap Q FF, GE 
Healthcare) followed by size-exclusion chromatography on Super- 
dex 200 16/60 column (GE Healthcare) in buffer D (20 mM Tris- 
HCl pH 7.5, 150 mM NaCl, 5 mM 2-mercaptoethanol). 

Pull-down assays 

Pull-down experiments were performed by mixing 52.5 [ig of 
Hisg tagged Patl [435-796] with 2 fold molar excess amounts of 
untagged Lsml-7 complex. Binding buffer (20 mM Tris-HCl 
pH 7.5, 100 mM NaCl, 50 mM imidazole, 10% Glycerol) was 
added to a final volume of 60 |a,L. The reaction mixtures were 
incubated on ice for 2 hours. 10 ^L was withdrawn to constitute 
the input fraction. The rest was incubated with 500 |J,g of HisPur 
Ni-NTA Magnetic Beads (Thermo Scientific) in a final volume of 
200 [lL at 4°C for 2 hours. Beads were washed three times with 
500 [iL of binding buffer. Bound proteins were eluted with 
250 mM imidazole. Samples were resolved on SDS-PAGE and 
visualized by Coomassie blue staining. 

Crystallization and structure determination 

Needle crystals were initially obtained for the ScPatlC domain 
in a wide range of conditions from two commercial kits (MB Class 
I and Procomplex from Qiagen) and some conditions were 
optimized to obtain diffracting crystals. All the crystals were cryo- 
protected by transfer into their crystallization condition with 
progressively higher ethylene glycol concentration up to 30% and 
then flash-cooled in liquid nitrogen. All datasets were collected on 
beam-line Proxima-1 (Synchrotron SOLEIL, Saint- Aubin, 
France). 

A first 4 A resolution dataset (crystal form I) was collected at Se- 
edge from a crystal of Se-Met labeled protein obtained by mixing 
1 nL of ScPatlC at 17 mg/mL in buffer A with an equal volume 
of crystallization solution (4% PEG 8,000; 0.1 M Tris-HCl 
pH 7.5). This dataset was processed with the MOSLEM and 
SCALA programs ([52,53], Table 1). This crystal belongs to space 
group P2i with two or three molecules in the asymmetric unit, 
corresponding to a solvent content of 63'% or 44%, respectively. 
The ScPatlC structure was solved by Se-SAD using this dataset. 
The SHELXD program was used to find an initial set of 12 Se 
sites in the 48-5 A resolution range [54] . Refinement of the Se sites 
and phasing were carried out with the SOLVE program [55]. 
Solvent flattening and NCS averaging were performed with the 
program RESOLVE [56]. The best experimental electron density 
maps were obtained for a solvent content of 65%, indicating that 
the asymmetric unit only contains two copies of the ScPatlC 
domain. Although of poor quality due to the limited resolution, 
these experimental maps allowed the building of several poly-Ala 
alpha-helices. 

In parallel, two additional datasets (crystal forms II and III) 
could be collected from crystals of native protein (10 mg/mL in 
buffer A) grown in 20% PEG 4,000; 50 mM sodium citrate; 5% 
isopropanol and 15% PEG 6,000; 7.5% MPD; 0.1 M MES 
pH 6.5, respectively. These datasets were processed with the XDS 
package [57]. Crystal form II belongs to space group P2i2i2i with 
three ScPatlC copies in the asymmetric unit and yielded a 2.3 A 
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Table 1. Data collection and refinement statistics. 






Crystal form 1 (Se-Met) 


Crystal form II 


Crystal form III 


Data collection 


Space group 


P2, 


P2,2,2, 


P2, 


Cell dimensions 


0, b, c (A) 


112.6, 37.6, 116.8 


36.3, 173.15, 175 


55.2, 88.6, 68.1 


oe, P, y n 


90, 99, 90 


90, 90, 90 


90, 96.1, 90 


Resolution (A) 


48-4.0 (4.22-4.0) 


30-2.3 (2.41-2.3) 


50-2.15 (2.27-2.15) 


Total number of reflections 


92,290 


189,218 


142,169 


Total number of unique reflections 


8,604 


49,604 


35,407 


■■sym 


0.113 (0.53) 


0.054 (0.56) 


0.081 (0.57) 


;/a; 


11.8 (5.3) 


15.7 (2.4) 


11.3 (2.1) 


Completeness (%) 


99.6 (100) 


99.6 (98.2) 


98.9 (95.8) 


Redundancy 


10.7 


3.8 


4 


Refinement 


Resolution (A) 




30-2.3 


50-2.15 


Rwork/Rfree (%)" 




24/30.4 


21/26.5 


R.m.s. deviations 


Bond lengthis (A) 




0.012 


0.008 


Bond angles { ') 




1.45 


1.064 


PDB code 




40JJ 


40GP 



"^i^factor^X l|Fol " |Fc||/|Fo|. Rfree was Calculated with a small fraction (5%) of randomly selected reflections. 
doi:l 0.1 371/journal.pone.0096828.t001 



resolution dataset. Crystal form III diffracted to 2.15 A resolution 
and belongs to space group P2i with 2 ScPatlC molecules in the 
asymmetric unit. Statistics on data collection and refinement are 
gathered within Table 1 . The dataset collected from crystal form II 
was then used for molecular replacement trials with the MOLREP 
program [58] to locate 3 copies of the poly- Ala model built in the 
4 A resolution experimental map (crystal form I). Successive cycles 
of buUding/refmement with the COOT [59] and PHENIX.RE- 
FINE [60] programs yielded a final model with R and RfVee values 
of 24% and 30.4%, respectively. The final model obtained from 
crystal form II is incomplete due to the lack of electron density for 
certain protein regions, most probably due to their intrinsic 
flexibility. The final model contains residues Gly474 to Tyr787 
from monomer A, Glu480 to Ile483, Phe523 to Gln694, Asp700 
to Leu 795 from monomer B and Lys476 to Lys796 (as well as the 
three first residues from the His-tag) from monomer C. In 
addition, one ethylene glycol molecule from the cryo-protection 
solution, one magnesium ion, one chloride ion and 231 water 
molecules were modeled in the 2Fo-Fc electron density map. 

The dataset collected from crystal form III was also used to 
refine the structure of ScPatlC domain. Two copies of the PatlC 
model were positioned by molecular replacement with MOLREP 
and refined using the PHENIX.REFINE program to reach final R 
and Rfree values of 21% and 26.5%, respectively (Table 1). The 
final model contains residues Gly474 to Ile644 and Leu654 to 
Lys796 to from monomer A, Gly473 to Ser648 and Leu654 to 
Lys796 from monomer B as well as 2 MES buffer molecules, 3 
ethylene glycol molecules from the cryo-protection solution and 
141 water molecules. 



Results 

Delineation of ScPatI C domain and structure 
determination 

As previous studies have emphasized on the role of ScPatl C 
domain (region 422-796) in translation repression [21], in 
reduction of 48S translation initiation intermediates accumulation 
[26] and in localization to P-bodies [21], we decided to determine 
its crystal structure to obtain information on its organization at the 
atomic level. Analysis of the sequence of the ScPatl C domain 
using bioinformatics tools [i.e. secondary structure prediction, 
HCA "Hydrophobic Cluster Analysis" tool [61]) and sequence 
conservation among yeast species revealed that the region 422-472 
is poorly conserved and is particularly enriched in Ser, Ala, Asn 
and basic residues (Lys/Arg), which are characteristics of 
unstructured regions (data not shown). On the opposite, the 
region encompassing residues 473 to 796 displays significant 
sequence identity within yeast species and is predicted to be well 
folded. We therefore cloned the ScPatlC (473-796) domain, over- 
expressed it in E. coli and purified it in milligram amounts to 
homogeneity. The SeMet labeled and unlabeled proteins crystal- 
lized in three difiFerent space groups (Table 1). Given the much 
lower resolution of the dataset collected from crystal form I SeMet 
protein crystals, this dataset was only used to obtain experimental 
electron density maps by the Single wavelength Anomalous 
Diffusion method (SAD) using the selenium anomalous signal 
but not for structure refinement. Only structures refined to higher 
resolution (crystal forms II and III) will be discussed in this 
manuscript. 

Three and two copies of the ScPatlC molecule are present in 
the asymmetric unit of crystal forms II and III, respectively. All 
these structures can be superimposed onto each other with rmsd 
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Figure 1 . Structure of 5. cerevisiae Pati C domain. A and B. Left panels: ribbon representation of ScPatI C crystal structure. The ARIVl repeats are 
depicted using different colors. IVIiddle panel: Mapping of the sequence conservation at the surface of the ScPatIC domain. Coloring is from grey (low 
conservation) to cyan (highly conserved). The conservation score was calculated using the CONSURF server [67] and using an alignment made from 
the sequences of 30 Pati fungal orthologues. Right panel: Mapping of the electrostatic potential at the surface of the ScPatIC domain. Positively 
(10 ksT/e') and negatively (-10 keT/e') charged regions are colored in blue and red, respectively. The electrostatic potential was calculated using 
PBEQ Solver server [68]. Orientation in B differs from A by a 180 rotation along the horizontal axis. C. Ribbon representation of the superimposition 
between human PatIC domain (green; [36]) and ScPatIC domain. The core conserved from yeast to human is colored in yellow. The fungi specific C- 
terminal extension from ScPatIC is colored in red. D. Surface representation of ScPatI domain. Same color code as panel C. 
doi:1 0.1 371/journal.pone.0096828.g001 



values ranging from 0.3 A to 1.2 A over 260-310 Ca atoms. The 
relatively high rmsd values observed between some molecules 
result from flexibility of both N- and C-terminal extremities of this 
domain (Fig.SlA). This flexibility is also supported by the 
observation that residues 484-524 from one protomer as well as 
residues 788-796 (corresponding to a P-sheet, see below) from 
another one are iU-defmed in the electron density maps and hence 
absent from the final model. In parallel, two independent 
structures of the ScPatIC domain bound to Lsml-7 complex 
[39] or Lsm2-Lsm3 subcomplex [40] were recently determined at 
3.7 A and 3.15 A resolution, respectively (rmsd values ranging 
from 0.9 A to 1.6 A with our structures of ScPatIC domain). In 
these structures, the C-terminal extremity of the ScPatIC domain 
exhibited higher flexibility than the remaining parts of the domain. 
This is illustrated either by the absence of the C-terminal fi-hairpin 
and of residues 743-764 (corresponding to the end of helix al4 



and the beginning of helix 0tl5) in the structure of the Patl-Lsml-7 
complex or by the high B-values of the residues 644-794 in the 
structure of the Patl-Lsm2-3 subcomplex compared to the other 
residues from this domain. Finally, the comparison of nine 
independent crystal structures has also highlighted flexibility of 
the N-terminal end of human Pat lb C-terminal domain [36]. This 
conserved characteristic suggests that flexible regions located at 
both ends of ScPatIC domain might be of functional importance. 

The crystal packing radically diflers between crystal forms II 
and III. In crystal form III, the two Pati molecules of the 
asymmetric unit exhibit a very intricate head-to-tail organization 
with an interacting surface area of 1,635 A^. As this value is in the 
range expected for interfaces within homodimers [62], this 
suggested that this ScPatIC domain could also exist as a 
homodimer in solution. To investigate this possibility, we have 
analysed the quaternary structure of ScPatIC in solution using 
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both size exclusion chromatography coupled online to a triple 
detection array (MALLS, Fig.SlB) and small angle X-ray 
Scattering (SAXS; Fig.SlC). MALLS measurements on a protein 
at 2 mg/mL yielded a molecular mass of 39.3 kDa, indicating that 
ScPatIC is monomeric in solution (the theoretical molecular 
weight for the monomer is 38.6 kDa). Furthermore, the SAXS 
experimental curves measured at various ScPatIC concentrations 
(5.4 and 11.2 mg/mL, i.e. concentrations closer to those used in 
the crystallization experiments) clearly correspond to the curve 
calculated from the coordinates of the ScPatIC monomer but not 
from those of the crystal dimer (Fig.SlC). This SAXS analysis 
indicates that: first, the large contact area observed in crystal form 
III is created by the crystal-packing and does not reflect a 
biologically relevant oligomeric state; second, ScPatIC adopts in 
solution a structure similar to that observed in the crystals. 

Saccharomyces cerevisiae PatIC is a member of the ARM 
repeat superfamily 

The structure of the C-terminal half of ScPatl is formed by 15 
a-helices followed by two C-terminal |3 strands. The a-helix 
packing of the protein adopts an a-a superhelix fold reminiscent of 
the ARM repeat superfamily. ScPatIC is organized as an array of 
5 repeats of 3 helices stacked against each other to form a solenoid 
structure' (Fig. lA— B). Within each repeat, the second and third 
helices are packed in an anti-parallel manner and the shorter first 
hehx is perpendicular to the two other helices [63]. Within the 
superhelix, the second and third helices from a repeat are stacked 
against the second and third helices from the neighboring repeats, 
respectively. The first helices from each repeat are all exposed to 
the same face of the solenoid. In the ScPat 1 C domain, the first 
hefix from repeat 1 is absent and the length of the ARM repeats 
ranges in length from around 40 to 70 amino acids. The fifth and 
final motif is capped at its C-terminal end by helix al5 followed by 
a two-stranded anti-parallel P-hairpin, which folds back onto 
helices 0{14 and al5. These two latter helices are long (25 to 30 
residues long) and protrude from the core of the protein. 

The ScPatIC domain adopts the same fold as the corresponding 
domain from human Patlb (rmsd value of 3.4— 3.8 A over 200 Ca 
atoms and 20% sequence identity. Fig. IC, [36]). The highest 
structural similarity is found in the region encompassing residues 
476 to 680 from yeast (520-732 from human Patlb). The C- 
terminal part of this domain (680-796 in yeast and 733-765 in 
human) is much more divergent and in particular, S. cerevisiae 
protein exhibits a significantiy longer C-terminal extension with 
two long a-helices (otl4, a 15) and a two stranded fi-hairpin 
(Fig. lA-B & D). This C-terminal extension, which also encom- 
passes helix a 12, is specific for fungi and varies in length among 
yeast species (Fig.2). 

Mapping of the sequence conservation at the surface of the 
ScPatIC reveals the presence of two highly conserved regions 
(Fig. lA-B). The first region is conserved both in fungal and in 
metazoan proteins and has very recentiy been shown to be direcdy 
involved in the interaction with the Lsml-7 complex [39,40]. It is 
mainly formed by residues located within helices txl (Lys475, 
Lys476, Glu483, Glu493 and Arg497) and a4 (Lys531, Lys534 
and Arg538) and displays a positively charged electrostatic 
potential (Fig. lA; Fig.2). The other conser\'ed region is located 
at the opposite side of the domain and corresponds to the fungi 
specific region (Fig. IB; Fig.2). This region is formed by the C- 
terminal end of ScPatIC (helices al3, 0(14, al5 and strands (31 
and P2) and is electrostatically neutral. 



The conserved positively charged N-terminal region from 

ScPatIC involved in the interaction with Lsm1-7 
complex, is important for yeast growth at 37°C 

To test for the importance of the conserved region located at the 
N-terminal extremity of ScPatl domain, we have used comple- 
mentation of Patl mutants (Fig.3). Since Patl is known to 
collaborate with other factors to induce mRNA decay and repress 
translation, we first constructed strains carrying deletion of PATI 
alone or in combination with DHIil, or with EDC3 and SCD6 
deletions. Consistent with previous observation, deletion of PATI 
resulted in slow growth phenotype at 25°C or 30°C and 
thermosensitivity at 37°C (Fig.SB; [15,16,19]). Exacerbated 
phenotypes, supporting pardy redundant functions, were observed 
in both the edc3A/scd6A/patlA and dhhlA/patlA strains. This is 
consistent with previous observations and with the participation of 
these factors in translation repression and/or mRNA decay [64]. 
Full-length Patl protein fuUy complements for the deletion of the 
PATI gene in all these strains while a Patl fragment lacking the 
cr>'stallized ScPatIC domain (Patl [1-472] or PatlAC) does not, 
indicating that this C-terminal domain is functionally important 
(Fig.3B). Based on this observation, we next mutated residues 
located within the conserved N-terminal region from ScPatIC by 
substituting lysine and arginine residues from helices al and a4 by 
glutamic acid (Fig. 3 A). This yielded three charge-inversion 
mutants (K475E/K476E, R497E and K531E/K534E/R538E) 
whose ability to complement for PATI deletion was tested in the 
three different genetic backgrounds. While the R497E and 
K475E/K476E mutants behave mosdy as the wild-type ScPatl 
protein, the K531E/K534E/R538E mutant exhibits a slighdy 
reduced complementation in the patl A strain (detected as smaller 
colonies), and does not complement for PATI deletion in dhhlA 
and edc3 A / scdGA strains (Fig.3B). Furthermore, as no complemen- 
tation defects are detectable at 25^0 or 3()°C for these three 
mutants, we conclude that these mutants are functional and well 
folded. This was confirmed by normal accumulation of these 
different point mutant proteins in yeast cells (Fig.S2). Altogether, 
these complementation analyses indicate that the positively 
charged conserved region and in particular K531, K534 and 
R538 residues, found at the N-terminal region of ScPatIC play an 
important functional role, that is more easily detected in the 
absence of the Dhhl helicase or of the Edc3 and Scd6 proteins. 

These functionally important conserved residues are located 
within the ScPatIC region interacting with the Lsml-7 complex 
[39,40]. As the charge inversion mutants that we have tested in 
our phenotypic analyses were not generated previously (only 
substitutions of some of these residues by Ala were performed 
[39,40]), we have investigated whether the complementation 
defects observed for the ScPatIC mutants could result from the 
inability of our mutants to interact with Lsml-7 complex. We have 
purified the Lsml-7 complex as well as various ScPatIC mutants 
following over-expression in E. coli and performed puU-down 
assays. As shown in Fig.3C, the untagged Lsml-7 complex was 
specifically retained on NiNTA beads when incubated with the 
wild-type Hisg tagged ScPatl fragment encompassing residues 
435-796 {i.e. slighdy longer than ScPatIC). Similarly, the R497E 
mutant as well as two other mutants targeting residues located 
within the opposite C-terminal extension present in Patl fungal 
proteins (Ci706A/L7 1 3A and Q,720A/R721A/D725A/R728A) 
interact to the same extent as the wild-type ScPatl domain. 
Interestingly, die K475E/K476E and K531E/K534E/R538E 
ScPatl mutants failed to interact with the Lsml-7 complex, 
confirming that the conserved positively charged residues (K475, 
K476, K531, K534 and R538) located within this N-terminal 
region from ScPatIC are direcdy involved in the interaction with 
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Figure 2. Sequence alignment of Pati orthologues. Alignment was performed using ClustalW [69]. Strictly conserved residues are in white on a 
black background. Partially conserved amino acids are boxed. Secondary structure elements assigned from the ScPatIC structure are indicated above 
the alignment. Black stars below the sequences indicate residues mutated in this study. This figure was generated using the ESPript server [70]. 
doi:10.1371/journal.pone.0096828.g002 



tlie Lsml-7 complex. Furthermore, in the case of the K531E/ 
K534E/R538E ScPatl mutam, the disruption of Lsml-7-PatlC 
interaction suggests that this interaction is crucial for yeast growth 
at 37°C in the absence of Scd6 and Edc3 proteins (Fig.3B). 
Although we did not detect interaction between K475E/K476E 
ScPatl C mutant and the Lsml-7 complex in vitro, this mutant 
complements for the deletion of PATI gene in the difiFerent 
background tested. This could indicate that this later mutant 
retains some residual binding activity towards the Lsml-7 complex 
in vivo and this would be sufRcient to support yeast growth at 37°C 
(see discussion). 

The yeast specific C-terminal extension from ScPatl C is 
involved in Edc3 binding 

We next investigated whether the second conserved region, i.e. 
the fungi specific C-terminal extension is important for Pati 
biological function (Fig. 4). We used the same yeast strains to assess 
the ability of ScPatl lacking residues 730 to 796 from this C- 



terminal extension (ScPatl AC 68) to complement for the deletion 
oi PATI gene (Fig.4B). This truncated protein was unable to fully 
restore growth at 37°C upon expression in all mutant strains with 
the strongest phenotypes again observed in the patlA/scd6A/edc3A 
and patlA/dhhlA strains (Fig.4B). A patlA strain carrying a 
deletion of the ScPatl C-terminal extension grows only slightly 
better than a strain with Pati AC and both deletions have similar 
effects in the patlA/dhhlA axiApatlA/scd6A/edc3A backgrounds. 
Hence, our data support that this C-terminal extension plays a 
functional role, whose importance is better detected in the more 
sensitive double mutant strains. 

We next investigated the importance of well-conserved residues 
within this region by substituting them by Ala to yield three 
different mutants: Q706A/L713A, Q720A/R721A/D725A/ 
R728A and E794A (Fig.4A). All these mutants were expressed at 
similar levels as the wild-type protein in yeast (Fig.S2) and fully 
complemented for PATI deletion in the three backgrounds tested 
indicating that they are not affected in their function. Surprisingly, 
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Figure 3. The conserved residues at the N-terminal residues are 
involved in interaction with Lsm1-7. A. ScPatIC structure with the 
residues mutated in this study shown as ball and sticks. Color scheme 
identical to Fig.lA. B. Growth analysis of PATI mutants. Serial dilutions 
of the different strains transformed with vector, a plasmid encoding 
wild type ScPatl, or the indicated mutant thereof, were deposited on 
plates and incubated at the indicated temperatures. C. Pull-down 
experiment of untagged Lsm1-7 complex with Hiss ScPatIC wild type 
or mutants. Input (top) and eluted (bottom) samples were analyzed by 
22% SDS-PAGE and Coomassie Blue staining. The asterisk denotes a 
contaminant protein that co-purifies with the Lsm1-7 complex but 
which is not retained on the HisPur™ Ni-NTA Magnetic Beads used for 
these pull-down experiments. Molecular weights (kDa) of the markers 
are indicated on the left of the gels. 
doi:10.1371/journal.pone.0096828.g003 

the patlA/scd6A/edc3A and patlA/dhhlA .strains expressing the 
Q706A/L713A and C1720A/R721A/D725A/R728A mutants 
grew faster at 37°C than the same strains expressing full-length 
wild-type Patl (Fig.4B). 

To go deeper into the role of this Patl fungi specific C-terminal 
extension, we tested the ability of full-length ScPatl and 
ScPatlAC68 to interact with Rps28, Edc3 and Scd6 in a two- 
hybrid assay (Fig.4C). In a wild type strain background, we could 
detect the Patl-Edc3 interaction that was previously reported [21] 
as well as positive signals for Patl-Rps28 and Patl-Scd6. The 
association with Rps28 and Scd6 were not previously observed 
using the two-hybrid assay but independent evidences support our 
observations. Indeed, an interaction between Patl and Scd6 was 
observed using purified factors [26]. Similarly, Rps28 is known to 
interact with Edc3 which itself binds Patl [30]. Interestingly, 
neither Edc3 nor Rps28 interact with the ScPatl AC68 protein 
while Scd6 interacts as efiicientiy as with the full-length Patl 
protein (Fig.4C). The latter observation indicates that 
ScPatlAC68 is well folded and functional. In the Aedc3 host 
strain, Rps28 was unable to interact with wild type ScPatl 
revealing thereby that Edc3 is necessary to bridge the two factors. 
We observed that the Patl-Edc3 interaction signal was much 
stronger (more than 60 fold) in the Aedc3 strain compared to wild 
type. This indicates that endogenous Edc3 competes with Edc3 
fused to the DNA binding domain for interacting with Pat 1 . Even 
with the very high signal observed in this context, deletion of the 
C-terminal region of Patl totally abolished the interaction with 
Edc3 (Fig.4C). The Scd6-Patl interaction signal was only slighdy 
enhanced in the l^edcS strain (3-fold) and remained strongly 
positive with the ScPatl AC 68 in this context. Thus, Scd6 interacts 
with Patl through an interface that remains to be identified but 
that differs at least in part from the one mediating Edc3 binding, 
as it does not require the fungal C-terminal extension of Patl. 
Consistently, Edc3 competed only weakly with the Scd6-Patl 
interaction. 

In summary, these data demonstrate that the C-terminal 
extension specific to fungal Patl proteins plays an important 
functional role, probably through its ability to mediate interaction 
with protein partners including Edc3. 

Discussion 

The Patl protein is involved in different aspects of mRNA 
decapping activation. Indeed, Patl represses translation initiation 
with its Dhhl and Scd6 partners [22,26,43] but also activates the 
Dcpl-Dcp2 enzyme together with Edc3 and, in yeast, Edcl and 
Edc2 [26,31]. Patl also interacts with the Lsml-7 complex, which 
binds to the 3' end of oligoadenylated mRNAs, thereby 
connecting 3' to 5' and 5' to 3' mRNA decay pathways 
[15,16,20,34,65]. Hence, Patl is considered as a central platform 
protein for decapping. Our present study further reinforces this 
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Figure 4. The yeast specific residues located at the C-terminus 
of ScPatIC are functionally important. A. ScPatK structure with 
the residues mutated in this study shown as ball and sticks. Color 
scheme identical to Fig.lA. The region absent in the ScPat1AC68 
construct is colored in grey. B. Growth analysis of PATI mutants. Serial 
dilutions of the different strains transformed with vector, a plasmid 



encoding wild type ScPatI, or the indicated mutant thereof, were 
deposited on plates and incubated at the indicated temperatures. C. 
IVlonitoring ScPatI interaction with Edc3, Rps28 or Scd6 using the two- 
hybrid assay. The wild-type or edcSA strain was transformed with the 
indicated pairs of vectors and interaction between the factors encoded 
by these two plasmids was scored by assaying p-galactosidase activity. 
doi:10.1371/journal.pone.0096828.g004 

idea. Indeed, the crystal structure of the S. cerevisiae Pat 1 C shows 
that this domain structurally belongs to the ARM repeat 
superfamUy, which mostly encompasses proteins involved in 
protein-protein interaction. This is consistent with already 
published data that have identified many protein partners for this 
domain (Lsml-7, Xrnl, Dcpl, Dcp2, Edc3 in yeast, human and/ 
or drosophila as well as EDC4 in human; [15,20,26,34,36]). 
Furthermore, sequence conservation mapping at the surface of 
ScPatIC domain reveals the presence of conserved residues 
clustered at both extremities of this domain and involved in 
protein recruitment by Patl. 

The first conserved region is located at the N-terminal part of 
the ScPat 1 C domain and the corresponding region from HPat was 
previously shown to be involved in the recruitment of the LSMl-7 
complex, DCP2 and EDC4 proteins by co-immunoprecipitation 
experiments performed in D. melanogaster cells [36]. Recently, two 
independent crystal structures of ScPatlC-Lsml-7 and ScPatlC- 
Lsm2-3 complexes clearly showed that this region (and in 
particular residues K475, K476, K531, K534 and R538) is 
involved in the interaction with Lsm3 within the Lsml-7 complex 
[39,40]. Furthermore, the mutation of some of these residues 
resulted in mRNA stabilization by altering mRNA decapping and 
3 '^5' decay in vivo [40]. We have observed that this N-terminal 
conserved region from PatlC is indeed of functional importance as 
illustrated by the thermosensitive phenotype observed for patl A/ 
scd6A /edc3A and patlA/dhhlA yeast strains expressing the Patl 
K531E/K,534E/R538E mutant (Fig.3B). We further confirmed 
that this ScPatl mutant, which was not generated in the other 
studies, fails to interact directly with the Lsml-7 complex using in 
vitro pull-down assays (Fig.3C). Contrary to the K531E/K534E/ 
R538E mutant, the K475E/K476E mutant does not interact with 
Lsml-7 complex in vitro but fully complements for the deletion of 
ScPatl in vivo (Fig.3B-C). This most probably results from the 
location of the K475 and K476 residues at the periphery of the 
ScPatlC-LsmI-7 interface while the K531, K534 and R538 
residues are located at the center of this interaction [39,40]. 
Indeed, one can speculate that the interaction between the Lsml-7 
complex and ScPatIC K475E/K476E mutant domain is too weak 
to resist the conditions of the in vitro pull-down assay performed 
with purified truncated Patl proteins while, in vivo, a residual 
interaction probably strengthened by the bridging activity of other 
decapping factors is likely to take place with this fuU-length ScPat 1 
mutant and to be sufficient for function. Altogether, these different 
studies rationalize the functional importance of these conserved 
Patl residues and indicate that the physical interaction between 
Patl and the Lsml-7 complex is crucial for yeast cell growth at 
37°C in the absence of Dhhl or Edc3 and Scd6 proteins. This 
further strongly suggests that the growth defects that we observe 
for the K531E/K534E/R538E mutant is most likely the result of 
the inability of this mutant to interact with Lsml-7 complex and 
then to activate mRNA decapping and 3 '^5' decay in vivo. 

Opposite to the ScPatIC region involved in the interaction with 
Lsml-7 complex, there is a fungi-specific C-terminal extension 
(encompassing residues 730-796 from ScPatl), which is important 
for growth at 25°C, 30°C and 37°C in the absence of the DHHl 
gene or of both EDC3 and SCD6 genes (Fig.4B). Expression of the 
ScPat 1AC68 protein in a Apatl background also results in a slight 
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growth defect at 37°C. Furthermore, we have observed using yeast 
two-hybrid that ScPatlAC68 no longer interacts with Edc3 and 
Rps28 (Fig.4C). The interaction with Rps28 requires Edc3, which 
itself interacts with Rps28 through a conserved linear motif [30]. 
However, the ScPatlAC68 protein still binds to Scd6, indicating 
that this truncated version of the protein is still present and folded 
in yeast. A previous report indicated that residues 697-763 from 
ScPatl are required for recruitment of Lsml-GFP to P-bodies and 
for the interaction of Patl with Edc3 but not Lsml [21]. As our 
structural information indicate that this internal deletion affects 
both the conserved and non-conserved regions of ScPat 1 , it was 
not possible to conclude about the role of the yeast specific region 
(residues 729 to 796) from these data. Our results indicate that it is 
required for Edc3 binding but it remains to be determined 
whether the Patl-Edc3 interaction is direct or bridged by other 
decapping factors. Indeed, ScPatl C domain was shown to interact 
physically with Dcp2, which in turn should interact with Edc3 Lsm 
domain through several short helical leucine-rich motifs (HLM) as 
shown in S. pombe [24,26]. Furthermore, no interaction between 
Edc3 and Patl proteins was detected by co-immunoprecipitation 
in D. melanogaster cells while the association described for human 
proteins was proposed to be indirect [20,34]. Hence, the 
involvement of the ScPatl C-terminal and fungal specific 
extension in the interaction with Edc3 is of particular interest. 
Other species-specific variations in the interaction network of 
decapping factors have been observed. For example, while the 
interaction between Dcpl and Dcp2 is direct in yeast, it is 
strengthened by EDC4/Hedls in human [32,33]. Similarly, the 
Edc3 and Scd6 Lsm domains interact with Dcp2 through short 
helical leucine-rich motifs (HLM) in S. pombe while in metazoan, 
the HLM motif involved in EDC3 binding is present in DCPl 
[24]. It has been proposed that phylogenetic variations in this 
protein interaction network arise through the appearance of Short 
Linear Interaction Motifs (SLiMs), which are intrinsically disor- 
dered when unbound [66] . While the appearance and evolution of 
such motifs are easy to envisage, our observation that yeast Patl 
proteins contain a C-terminal extension that is structured and 
required for Edc3 binding, indicates that evolution can also 
promote the formation of more complex and structured protein 
appendages that facilitate protein association. 

In conclusion, the crystal structure of the ScPatl C-terminal 
domain has re\calcd two functionally important regions localized 
at opposite' extremities of the domain. One region is enriched in 
positively charged residues and is important for yeast growth at 
37°C. It is involved in direct binding to the Lsml-7 complex that is 
required for decapping activation. The second region is required 
for the interaction of ScPatl with Edc3 and its precise role remains 
to be elucidated. 

Supporting Information 

Figure SI A. Superimposition of the five ScPatl C structures 
(each ScPatlG structure is shown with a different color). The rmsd 
values range from from 0.3 A to 1.2 A over 260-310 Ca atoms. B. 
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